Reliable MPI-IO through Layout-Aware Replication

نویسندگان

  • Seung Woo Son
  • Samuel Lang
  • Robert Latham
  • Robert Ross
  • Rajeev Thakur
چکیده

The current deployment of petascale systems and the promise of future exascale systems have created unprecedented challenges in how to manage failures in such systems. While many parallel file systems provide some sort of redundancy mechanism to cope with failures, such systems rely heavily on a hardware-based solution such as RAID. In this paper, we propose a block replication approach to store data redundantly. The approach does not depend on file system fault-tolerance mechanisms. Rather, the approach replicates each file block transparently within MPI-IO, using replicationaware datatypes. File striping information is used to place blocks from each replica in a separate storage node. We have implemented this replication mechanism in the MPI-IO layer. Our experimental results using a microbenchmark and real MPI-IO applications with PVFS and Lustre demonstrate that block replication in MPI-IO can be achieved transparently.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RADAR: Runtime Asymmetric Data-Access Driven Scientific Data Replication

Efficient I/O on large-scale spatiotemporal scientific data requires scrutiny of both the logical layout of the data (e.g., row-major vs. column-major) and the physical layout (e.g., distribution on parallel filesystems). For increasingly complex datasets, hand optimization is a difficult matter prone to error and not scalable to the increasing heterogeneity of analysis workloads. Given these f...

متن کامل

ViMPIOS, a "truly" portable MPI-IO implementation

We present ViMPIOS, a novel MPI-IO implementation based on ViPIOS, the Vienna Parallel Input Output System. ViMPIOS inherits the defining characteristics of ViPIOS, which makes it a client-server based system focusing on cluster architectures. ViMPIOS stands out from all other MPI-IO implementations by its ”truly” portable design, which allows not only applications to be transferred between par...

متن کامل

The Design and Implementation of a MPI-Based Parallel File System

This paper presents the design of an MPI (Message Passing Interface)-based parallel file system, MPFS. MPI-IO is an extension of MPI which supports flexible logical file partition and physical file organization as well as a rich set of file access functions. MPFS enables users to specify both logical file partitions among user processes and physical file data layouts across data servers. The pr...

متن کامل

On the Implementation of a Portable, Client-Server Based MPI-IO Interface

In this paper we present the MPI-IO Interface kernel in the Vienna Parallel Input Output System (ViPIOS), which is a client-server based parallel I/O system. Compared to the already existing parallel I/O systems and libraries the concept of an independent distributed server promises to greatly enhance the usability and acceptance of the I/O system as well as the portability of client applicatio...

متن کامل

Implementing MPI-IO Shared File Pointers Without File System Support

The ROMIO implementation of the MPI-IO standard provides a portable infrastructure for use on top of any number of different underlying storage targets. These targets vary widely in their capabilities, and in some cases additional effort is needed within ROMIO to support all MPI-IO semantics. The MPI-2 standard defines a class of file access routines that use a shared file pointer. These routin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011